Substrate for the epidermal growth factor receptor kinase

ABSTRACT

A new substrate of epidermal growth factor receptor and certain other tyrosine kinase receptors denominated eps15, polynucleotides encoding epsl5, antisense eps15 polynucleotide, triple helix eps15 polynucleotide, antibodies to eps15, and assays for determining eps15.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a divisional of U.S. patent application Ser. No. 08/095,737, filed Jul. 22, 1993, now U.S. Pat. No. 5,487,979, which is a continuation in part of Ser. No. 07/935,311, filed Aug. 25, 1992, now U.S. Pat. No. 5,378,809, which applications are hereby incorporated by reference herein.

FIELD OF THE INVENTION

The present invention relates to substrates for the epidermal growth factor receptor kinase, polynucleotides encoding the substrates, and methods for using the substrates.

BACKGROUND OF THE INVENTION

The cellular machinery involved in mitogenesis is complex, and not fully understood. In general, receptors present on the cell surface bind growth factors, resulting in an activated receptor. In particular, the receptors of interest are endowed with intrinsic tyrosine kinase activity, and are known as tyrosine kinase receptors or TKRs. The activated receptors, in turn, phosphorylate intracellular substrates. These phosphorylated substrates are responsible for a series of events that leads to cell division. This process is generally referred to as "mitogenic signal transduction." The molecular machinery involved in this process is considered to be the "mitogenic signaling pathway."

Growth factors and hormones exert pleiotropic effects on cellular functions, including mitogenic stimulation and modulation of differentiation and metabolism (Ullrich et al., Cell 61: 203-212 (1990); Aaronson, Science 254: 1146-1153 (1991)). In many cases, these effects are mediated by the interaction of growth factors with cell surface tyrosine kinase receptors (TKRs), resulting in enhanced receptor catalytic activity and tyrosine phosphorylation of intracellular substrates (Ullrich et al., supra; Aaronson, supra). Data regarding the nature of these second messenger systems is still scanty, although some molecules which associate and/or are tyrosine phosphorylated by TKRs have been identified. These include the γ isozyme of phospholipase C (PLC-γ) (Margolis et al., Cell 57: 1101-1107 (1989); Meisenhelder et al., Cell 57: 1109-1122 (1989); Wahl et al., Mol. Cell. Biol. 9: 2934-2943 (1989)); the p21ras GTPase activating protein (GAP) (Molloy et al., Nature 342: 711-714 (1989); Kaplan et al., Cell 61: 125-133 (1990); Kazlauskas et al., Science 247: 1578-1581 (1990)); the raf serine-threonine kinase (Morrison et al., Proc. Natl. Acad. Sci. USA 85: 8855-8859 (1988); Morrison et al., Cell 58: 649-657 (1989)); the p85 subunit of the phosphatidylinositol 3-kinase (PtdIns-3K); (Coughlin et al., Science 243: 1191-1194 (1989); Kazlauskas et al., Cell 58: 1121-1133 (1989); Varticovski et al., Nature 342: 699-702 (1989); Ruderman et al., Proc. Natl. Acad. Sci. USA 87: 1411-1415 (1990); Escobedo et al., Cell 65: 75-82 (1991); Skolnik et al., Cell 65: 83-90 (1991); Otsu et al., Cell 65: 91-104 (1991)) and some cytoplasmic tyrosine kinases (Gould et al., Mol. Cell. Biol. 8: 3345-3356 (1988); Kypta et al., Cell 62: 481-492 (1990)). These signaling molecules are thought to mediate at least in part the mitogenic effects of TKRs (Ullrich et al., Cell 61: 203-212 (1990); Aaronson, Science 254: 1146-1153 (1991)).

However, the epidermal growth factor (EGF) receptor (EGFR) does not appear to efficiently interact with known second messenger systems (Fazioli et al., Mol. Cell. Biol. 11: 2040-2048 (1991); Segatto et al., Mol. Cell. Biol. 11: 3191-3202 (1991)). Thus, there is a need to ascertain the mechanism by which the EGFR functions in mitogenesis, and a particular need to identify and characterize the substrate (if any) of the EGFR.

Errors which occur in the mitogenic signaling pathway, such as alterations in one or more elements of that pathway, are implicated in malignant transformation and cancer. It is believed that in at least some malignancies, interference with abnormal mitogenic signal transduction could cause reversion of cells to a normal phenotype.

In addition, reagents useful in identifying molecular components of the mitogenic signaling pathway would find utility as tumor markers for therapeutic, diagnostic, and prognostic purposes. Furthermore, identification of how such components differ from normal components in malignant tissue might be of significant value in understanding and treating such malignancies. Alterations of the EGFR mitogenic signal transduction pathway have been described in several human tumors. Accordingly, substrates of the EGFR are of particular interest.

Finally, there is a need to identify reagents that can be used to determine the tyrosine kinase activity of particular samples of biological origin. Determination of the tyrosine kinase activity of samples could have value in the therapy, diagnosis, and prognosis of neoplasia and other disorders connected with abnormal mitogenic signaling pathways.

It is therefore an object of the present invention to provide reagents and methods useful in identifying components of the mitogenic signal transduction pathway, for determining tyrosine kinase activity of samples, and for determining how particular components of the pathway in abnormal tissue differ from normal components. In particular, it is an object of the invention to provide reagents and methods that relate to the EGFR substrate(s).

SUMMARY OF THE INVENTION

A method is disclosed which allows direct cloning of intracellular substrates for tyrosine kinase receptors (TKRs). By applying this technique to the study of the epidermal growth factor (EGF) receptor (EGFR) signaling pathway, a cDNA designated eps15 has been isolated. The structural features of the deduced eps15 gene product allow its subdivision into three domains. Domain I contains signatures of a regulatory domain, including a candidate tyrosine phosphorylation site and EF-hand type calcium-binding domains. Domain II presents the characteristic heptad repeats of coil-coiled rod-like proteins. Domain III displays a repeated aspartic acid-proline-phenylalanine motif reminiscent of a consensus sequence of several methylases. Eps15 does not bear the Src homology 2 (SH2) and Src homology 3 (SH3) domains, characteristic signatures of TKR substrates. Antibodies specific to the eps15 gene product recognize a protein of 142 kDa and a minor component of 155 kDa, which are phosphorylated on tyrosine following EGFR activation by EGF in vivo. In addition, phosphorylation of the eps15 gene product must be relatively receptor-specific, since erbB-2, an EGFR-related kinase, phosphorylates it very inefficiently. By employing chimeric molecules between EGFR and gp185^(erbB-2), the region of the EGFR responsible for the differential phosphorylation of eps15 could be mapped to its juxtamembrane region. Overexpression of eps15 is sufficient to transform NIH-3T3 cells, thus implicating the eps15 gene product in the regulation of mitogenic signals.

Thus, one aspect of the present invention is isolated or purified polynucleotide encoding eps15 substrate of the epidermal growth factor receptor, preferably mammalian eps15, and more preferably human eps15. The sequence can include polynucleotide coding for mouse eps15, including polynucleotide encoding the amino acid sequence of SEQ ID NO:4, the DNA sequence of SEQ ID NO:3, and a mRNA transcript of SEQ ID NO:3. Also encompassed within the scope of the invention is sequence coding for human eps15 that includes the polynucleotide encoding the amino acid sequence of SEQ ID NO:2, the DNA sequence of SEQ ID NO:1, and a mRNA transcript of SEQ ID NO:1. Moreover, the invention includes an antisense oligonucleotide and a triple helix probe capable of blocking expression of the eps15 gone product, preferably including at least 15 nucleotides.

The invention further comprises isolated or purified eps15, preferably mammalian eps15, and more preferably human eps15. Included within the scope of the invention is mouse eps15, which can have the amino acid sequence of SEQ ID NO:4. The human eps15 advantageously includes the amino acid sequence of SEQ ID NO:2. The concentration of the isolated or purified eps15 is preferably at least 1 μg/ml.

Another aspect of the invention is isolated or purified antibody to eps15, including both monoclonal and polyclonal antibody.

Yet another aspect of the invention is a construct including a vector and sequence encoding the eps15 substrate; and a host transformed therewith.

Furthermore, the invention features a method for enhancing the mitogenic response of cells to mitogenic factors, including the step of administering to the cells an effective mitogenic-response enhancing amount of eps15.

Another aspect of the invention is a method for determining TKR tyrosine kinase activity in a biological sample, including the steps of combining eps15 with the sample, and measuring tyrosine phosphorylation of the eps15 by the TKR tyrosine kinase in the sample.

The invention also provides a method for determining eps15 in a sample, including the steps of contacting the sample with antibody to eps15, such that an immunological complex forms between eps15 and the antibody, and detecting the formation of the immunological complex.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 presents an analysis of the heptads in the second of the three structural domains characterizing eps15 deduced from the the mouse cDNA. The amino acid position in each heptad is plotted against the frequency of hydrophobic residues at each position. Hydrophobicity was evaluated according to Kyte and Doolittle. (Kyte and Doolittle, J. Mol. Biol. 157: 105-132 (1982)). Leucine residues are represented by closed boxes, other hydrophobic residues by dashed boxes.

FIG. 2 provides a secondary structure analysis of the eps15 protein product deduced from the mouse cDNA. Scores were calculated with the Chou-Fassman algorithm. (Chou and Fassman, Adv. Enzymo. Relat. Areas Mol. Biol. 47: 45-148 (1978)). Turns, alpha helices, and beta sheets are graphed against amino acid number.

FIG. 3 profiles the hydropaticity of eps15 deduced from the mouse cDNA. Values were calculated according to Kyte and Doolittle (Kyte and Doolittle, supra) on a window of 10 amino acids with equal weights. Positive values indicate hydrophobicity, negative values hydrophilicity. Hydropaticity is diagrammed against amino acid number.

DETAILED DESCRIPTION OF THE INVENTION

The present invention includes the discovery of a novel epidermal growth factor (EGF) receptor (EGFR) substrate which is called eps15 (EGFR pathway substrate), together with complete cDNA and deduced protein sequences for the murine eps15 (SEQ ID NOS:3 and 4, respectively) and cDNA and deduced protein sequences for the-human eps15 (SEQ ID NOS:1 and 2, respectively). The protein sequences are referred to as "deduced" sequences simply because they were determined from the nucleotide sequence, rather than from analysis of purified natural protein.

In addition, the present invention provides methodology for isolating cDNA and protein sequences of other species; antibodies which recognize the proteins encoded by the cDNAs; expression vectors for producing eps15 in prokaryotic or eukaryotic cells; cell lines expressing eps15; and assays using the antibodies, cDNA sequences, and proteins.

The sequences falling within the scope of the present invention are not limited to the specific sequences described, but include functional fragments thereof, that is, fragments that function in substantially the same way as do the specific sequences described. Functional polynucleotide fragments having at least about 60 nucleotides, advantageously at least about 30 nucleotides, and preferably at least about 15 nucleotides are contemplated. Regarding functional polypeptide fragments, those having at least about 20 amino acid residues, advantageously at least about 10 amino acid residues, and preferably at least about 5 amino acid residues are considered. Moreover, the invention includes conservative variants of the specific proteins described, i.e., substituents incorporating conservative changes for any of the amino acid residues provided herein. Further, to accommodate codon degeneracy, the invention encompasses any DNA sequence encoding the proteins of the present invention.

The eps15 proteins, polynucleotide sequences, constructs, vectors, clones, and other materials comprising the present invention can advantageously be in isolated form. As used herein, the term "isolated" denotes that the material has been removed from its original environment (e.g., the natural environment if it is naturally occurring). For example, a naturally-occurring polynucleotide or polypeptide present in a living animal is not isolated, but the same polynucleotide or polypeptide, separated from some or all of the coexisting materials in the natural system, is isolated.

It is also advantageous that the sequences and other materials comprising the invention be in purified form. The term "purified" does not require absolute purity; rather, it is intended as a relative definition. Purification of starting material or natural material means that the concentration of the material is at least about 2, 5, 10, 100 or 1000 times its original concentration (for example), advantageously 0.01% by weight, preferably at least about 0.1% by weight. Purified preparations of about 0.5%, 1%, 5%, 10% and 20% by weight are also contemplated.

A. Overview

Two lines of evidence indicate that the EGF receptor is not very efficient at coupling with known second messenger systems. There is a low stoichiometry of tyrosine phosphorylation (˜1% of the total pools), of PLC-γ, and GAP weak induction of PIP2 breakdown, and very little phosphorylation/activation of raf or activation of PtdIns-3K by EGFR, even when overexpressed at levels of approximately 2×10⁶ receptors/cell (Fazioli et al., Mol. Cell Biol. 11: 2040-2048 (1991)). In addition, a mitogenesis-incompetent mutant of the EGFR (EGFR Δ660-667 Segatto et al., Mol. Cell Biol. 11: 3191-3202 (1991)) did not show any decreased ability to phosphorylate PLC-γ or GAP, or to induce PIP2 breakdown as compared to the wild type EGFR (Segatto et al., Mol. Cell Biol. 11: 3191-3202 (1991)). This strongly indicated the existence of alternative effector pathways for mitogenic signal transduction by EGFR.

Characterization of EGFR-activated pathways requires the identification of novel proteins that are tyrosine phosphorylated following stimulation of this receptor kinase. The present invention utilized a novel approach to the cloning of cDNAs coding for EGFR substrates, as disclosed in Fazioli et al., J. Biol. Chem. 267: 5155-5157 (1992) and Fazioli et al., Oncogene 8: 1335-1345 (1993), which are hereby incorporated by reference. The approach relies on batch purification of the entire set of proteins that are phosphorylated on tyrosine following EGFR activation and generation of antisera directed against the entire pool of purified proteins. These sera can be used to immunologically characterize various substrates or for expression screening of cDNA libraries.

B. Identification of Murine cDNA Encoding eps15

Antibodies to phosphotyrosine were used to isolate proteins that were tyrosine-phosphorylated upon EGF stimulation of NIH-3T3 murine fibroblasts overexpressing the EGFR (NIH-EGFR cells), as discussed in Example 1. A strategy was developed that allowed direct cloning of the cDNAs encoding several of these proteins. Briefly, two polyclonal sera were generated using the entire purified pool of phosphotyrosine (pTyr)-containing proteins as an immunogen (Fazioli et al., J. Biol. Chem. 267: 5155-5157 (1992)). These antibodies were used for expression screening of cDNA libraries, as reported in greater detail in Examples 1 and 2.

A novel cDNA isolated by this method was sequenced as described in Example 3, and the encoded protein was designated eps15. The nucleotide sequence of the eps15 cDNA is presented herein as SEQ ID NO:3. The deduced amino acid sequence, given herein as SEQ ID NO:4, describes a protein bearing a candidate tyrosine phosphorylation site but, surprisingly, no Src homology 2 (SH2) or Src homology 3 (SH3) domains which are the characteristic hallmarks of TKR substrates. Antibodies generated against the cDNA protein product in accordance with Example 4 recognized in NIH-EGFR cells a 142 kDa protein and a less abundant 155 kDa protein, both of which were phosphorylated on tyrosine residues following treatment of intact cells with EGF.

C. Features and Properties of eps15 Protein

The amino acid sequence deduced from the single eps15 ORF provides a 897 amino acid protein with a calculated molecular weight of approximately 98 kDa.

The features of the deduced polypeptide indicated the presence of three structural domains. Domain I (spanning between amino acid positions 9-314) consisted of three imperfect repeats of 95-97 amino acids, with a 55-60% overall degree of conservation. The second repeat included a tyrosine, at position 132, flanked by the consensus sequence for putative tyrosine phosphorylation sites (Bairoch, Nucleic Acids. Res. 20: 2013-2018 (1992); Hunter, J. Biol. Chem. 257: 4843-4848 (1982)). This tyrosine residue was conserved in the first and third repeat of Domain I, although it was not flanked by the phosphorylation consensus. The second and third repeat also contained consensus sequences for calcium-binding domains of the EF-hand type (Bairoch, Nucleic Acids. Res. 20:2013-2018 (1992); Heizmann et al., Trends Biochem. Sci. 16: 98-103 (1991)) at positions 173-185 and 236-248, respectively.

Domain II (amino acid positions 335-502) presented the typical features of an α-helical coiled-coil structure, common to several cytoskeleton-related proteins. The requisite for the formation of a coiled-coil α-helix is the presence of heptad repeats, in which the first and fourth positions usually contain hydrophobic amino acid residues (McLachlan et al., J. Mol. Biol. 164: 605-626 (1983)). Domain II of eps15 was composed of 24 contiguous heptads whose positions 1 and 4 were markedly biased in favor of apolar amino acids, in particular leucine. FIG. 1. In addition, Domain II contained only 2 glycines and 1 proline (1.8% of the total residues, as opposed to 11.8% in the entire eps15) which are both strong α-helix breakers. Secondary structure analysis (FIG. 2) indicated that this region has the potential to form an α-helix.

Domain III (amino acids 598-842) was characterized by the repetition of a three amino acid motif, aspartic acid-proline-phenylalanine (DPF repeat); thirteen perfect DPFs and eight imperfect repeats (DXF, DPX and XPF) were identified. A hydropaticity plot of eps15 (FIG. 3) did not reveal any long stretch of hydrophobic amino acids that would qualify as a transmembrane domain, nor any signal peptide.

Remarkably, no Src homology 2 (SH2) or Src homology 3 (SH3) domains, which are characteristic signatures of TKR substrates (Koch et al., Science 252: 668-674 (1991)), were identifiable. Src homology (SH) regions 2 and 3 are noncatalytic regions conserved among substrates regulated by TKRs. The SH2 domains of these substrates bind activated TKRs and other tyrosine phosphorylated proteins, thus suggesting a role for SH2 in the mediation of protein-protein interactions. SH3 sequences are implicated in acting together with SH2 to modulate interactions with the cytoskeleton and membrane. The lack of SH2 and SH3 domains suggests that eps15 exploits other novel mechanisms for regulating protein-protein interactions during signal transduction.

D. Obtaining of Human cDNA for eps15

The present invention includes partial and complete human cDNA sequences and human genomic DNA sequences for eps15. A partial human eps15 sequence was obtained by PCR amplification from a human cDNA library using short sequences of the mouse eps15 cDNA as primers. This procedure is explained in more detail in Example 5.

A resultant PCR product containing partial sequence for human eps15 was used as a probe to identify a human cDNA clone potentially corresponding to a full-length transcript. See Example 6. The cDNA sequence for this human eps15 clone is set forth herein as SEQ ID NO:1. The deduced protein sequence of the human eps15 was 896 amino acids in length and included the entire open reading frame corresponding to the mouse eps15 cDNA. The human amino acid sequence displayed 90% identity to the mouse sequence. The peptide sequence for human eps15 is included herein as SEQ ID NO:2.

Although the described human eps15 cDNA clone may not be full length, such a clone can be obtained using well known methodology. For example, antibodies against the expression product of the human cDNA sequence of SEQ ID NO:1 can be used for expression screening of a cDNA library. These experiments can employ the techniques set forth in Example 4 to generate polyclonal antibodies against human eps15, and then candidate clones can be identified as set forth in Example 2 and characterized by sequencing as set forth in Example 3.

Additionally, conventional biochemical techniques permit use of a partial or complete cDNA clone as a probe to identify a cDNA corresponding to a full-length transcript or a genomic clone having the complete eps15 gene, including regulatory and promoter regions, exons, and introns.

One general approach for obtaining a complete cDNA sequence or genomic DNA sequence corresponding to the human eps15 gene is as follows:

1. Label a human eps15 cDNA and use it as a probe to screen a human lambda phage cDNA library or a human plasmid cDNA library.

2. Identify colonies containing clones related to the probe cDNA and purify them by known purification methods.

3. Nucleotide sequence the ends of the newly purified clones to identify full length sequences.

4. Perform complete nucleotide sequencing of putative full length clones by Exonuclease III digestion or primer walking using art-known means. Northern blots of mRNA from various tissues using at least part of the clone as a probe can be performed to check the size of the mRNA against that of the purported full length cDNA.

More particularly, all or part of the DNA sequence of SEQ ID NO:1 may be used as a probe to identify cDNA clones containing the full length cDNA sequence. The partial sequence of SEQ ID NO:1, or portions thereof, can be nick-translated or end-labelled with ³² P using polynucleotide kinase and labelling methods known to those with skill in the art (Basic Methods in Molecular Biology, L. G. Davis, M. D. Dibner, and J. F. Battey, ed., Elsevier Press, N.Y. 1986). A lambda library can be directly screened with the labelled cDNA probe, or the library can be converted en masse to pBluescript® (Stratagene, La Jolla, Calif.) to facilitate bacterial colony screening. Both methods are well known in the art.

Briefly, filters with bacterial colonies containing the library in pBluescript® or bacterial lawns containing lambda plaques are denatured and the DNA is fixed to the filters. The filters are hybridized with the labelled probe using hybridization conditions described by Davis et al. All or part of SEQ ID NO:1, cloned into lambda or pBluescript®, can be used as a positive control to assess background binding and to adjust the hybridization and washing stringencies necessary for accurate clone identification. The resulting autoradiograms are compared to duplicate plates of colonies or plaques, where each exposed spot corresponds to a positive colony or plaque. The colonies or plaques are selected, expanded, and the DNA is isolated from the colonies for further analysis and sequencing.

Positive cDNA clones in phage lambda may be analyzed to determine the amount of additional sequence they contain using PCR with one primer from the sequence obtained from SEQ ID NO:1 and the other primer from the vector. Clones with a larger vector-insert PCR product than the original clone are analyzed by restriction digestion and DNA sequencing to determine whether they contain an insert of the same size or similar as the mRNA size on a Northern blot.

Once one or more overlapping cDNA clones are identified, the complete sequence of the Clones can be determined. The preferred method is to use Exonuclease III digestion (McCombie et al., Methods 3: 33-40 (1991)). A series of deletion clones is generated, each of which is sequenced. The resulting overlapping sequences are assembled into a single contiguous sequence of high redundancy (usually three to five overlapping sequences at each nucleotide position), resulting in a highly accurate final sequence.

A similar screening and clone selection approach can be applied to obtaining cosmid or lambda clones from a genomic DNA library (Kirkness et al., Genomics 10: 985-995 (1991)). Although the process is much more laborious, these genomic clones can also be sequenced in their entirety. A shotgun approach is preferred to sequencing clones with inserts longer than 10 kb (genomic cosmid and lambda clones). In shotgun sequencing, the clone is randomly broken into many small pieces, each of which is partially sequenced. The sequence fragments are then aligned via computer to produce a final contiguous sequence with high redundancy. An intermediate approach to obtaining genomic DNA sequence is to sequence just the promoter region and the intron-exon boundaries and to estimate the size of the introns by restriction endonuclease digestion.

E. Expression of eps15 Gene

Following isolation and characterization of the eps15 gene, it is routine to express that gene in a recombinant organism to obtain significant amounts of eps15. One example of a suitable expression vector and host is set forth in Example 4. Alternatively, the DNA encoding eps15 can be inserted into other conventional host organisms and expressed. The organism can be a bacterium, yeast, cell line, or multicellular plant or animal. The literature is replete with examples of suitable host organisms and expression techniques. For example, naked polynucleotide (DNA or mRNA) can be injected directly into muscle tissue of mammals, wherein it is expressed. This methodology can be used to deliver the polynucleotide and, therefore, the resulting polypeptide translation product to the animal, or to generate an immune response against a foreign polypeptide (Wolff et al., Science 247: 1465 (1990); Felgner et al., Nature 349: 351 (1991)). Alternatively, the coding sequence, together with appropriate regulatory regions (i.e., a construct), can be inserted into a vector, which is then transfected into a cell. The cell (which may or may not be part of a larger organism) then expresses the polypeptide.

F. In vivo Transcription of eps15

In order to assess the expression of mRNA encoded by the murine eps15 gene, we performed Northern blot analysis of poly(A)+RNA extracted from NIH-3T3 cells using the pl 15 insert as a probe. Two major bands of ˜3.3 and ˜6.0 kb were detected. The size of the smaller band was in agreement with that of the eps15 cDNA clone. The nature of the 6.0 kb band was not resolved. It is unlikely that the band represents a related species since hybridization was performed under high stringency conditions. Thus, it most likely represents a partially processed precursor or an alternatively spliced form of the transcript.

G. Assays for Detecting eps15

Antibodies generated against the eps15 polypeptide can be obtained by direct injection of the naked polynucleotide into an animal (Wolff et al., Science 247: 1465 (1990)) or by administering the polypeptide to an animal, as explained in Example 4. The antibody so obtained will then bind the polypeptide itself. In this manner, even a sequence encoding only a fragment of eps15 can be used to generate antibodies binding the entire native polypeptide.

Antibodies generated in accordance with Example 4 can be used in standard immunoassay formats to detect the presence and/or amount of eps15 in a sample. Such assays can comprise competitive or noncompetitive assays. Radioimmunoassays, ELISAs, Western Blot assays, immunohistochemical assays, immunochromatographic assays, and other conventional assays are expressly contemplated. Furthermore, polyclonal antibodies against human or other eps15 can be readily generated using the techniques of Example 4, and monoclonal antibodies to any form of eps15 can be generated using well-known methods. All of these antibodies can be used in assays of the present invention. The assays and the antibodies discussed herein form an embodiment of this invention.

H. Detection of TKR Kinase Activity

In many instances, it is important to know the TKR tyrosine kinase activity of a sample. The present invention provides a ready method by which such activity can be determined. As discussed in more detail infra, eps15 is tyrosine phosphorylated by the EGFR as well as by other tyrosine kinase receptors. This ability of TKRs to phosphorylate eps15 is exploited to measure the presence of TKRs in a sample.

Briefly, one method for such measurement is to contact a sample with eps15, add radiolabeled γ-ATP to the sample and then measure the extent to which the radiolabel is incorporated into the eps15. Anti-eps15 antibodies may be used in the final step of the assay to capture eps15 for measurement of phosphorylation. Such assays are disclosed in more detail in Examples 9 and 10.

I. Differential Phosphorylation of eps15 by the EGFR and

erbB-2 Kinase

As discussed in more detail herein, eps15 is tyrosine phosphorylated by the EGFR kinase and much less efficiently by the highly related gp185^(erbB-2) kinase, an unexpected finding in light of the high degree of homology between these two kinases.

The availability of anti-eps15 antibodies permitted testing of the specificity of eps15 phosphorylation by the EGFR and erbB-2 kinases. Tyrosine phosphorylation of the major species of the eps15 gene product, p142^(eps15), was measured. Two mass populations of NIH-3T3 cells transfected either with EGFR or with the EGFR/erbB-2 chimera containing the extracellular domain of EGFR and the intracellular domain of erbB-2 were utilized (Fazioli et al., Mol. Cell. Biol. 11: 2040-2048 (1991); Lee et al., EMBO J. 8: 167-173 (1989); Lehvaslaiho et al., EMBO J. 8: 159-166 (1989); Lonardo et al., New Biol. 2: 992-1003 (1990)). The NIH-EGFR and NIH-EGFR/erbB-2 lines expressed comparable number of receptors and exhibited similar affinity for EGF binding, thus allowing rigorous quantitative analysis of the in vitro phosphorylation events triggered by EGF stimulation.

EGF treatment of NIH-EGFR/erbB-2 cells resulted in little if any tyrosine phosphorylation of p142^(eps15), although it induced readily detectable autophosphorylation of the chimeric receptor and phosphorylation of a number of other intracellular proteins. Conversely, EGF stimulation of NIH-EGFR promptly triggered p142^(eps15) tyrosine phosphorylation. The differential response of NIH-EGFR and NIH-EGFR/erbB-2 could not be ascribed to intrinsic differences in the two cell lines, since PDGF stimulation was able to elicit p142^(eps15) phosphorylation in both of them. It was concluded, therefore, that the erbB-2 kinase is much less efficient than the EGFR at stimulating tyrosine phosphorylation of p142^(eps15).

A panel of chimeric molecules constructed from regions of the EGFR and erbB-2 kinase was employed to map the portion(s) of these receptors responsible for the differential tyrosine phosphorylation of eps15 by EGFR and erbB-2. The chimeras were engineered by substituting domains of gp185^(erbB-2) for the corresponding regions of the EGFR and have been previously described and characterized (Di Fiore et al., Cell 51: 1063-1070 (1987); Di Fiore et al., Mol. Cell. Biol. 10: 2749-2756 (1990); Di Fiore et al., Science 248: 79-83 (1990); Segatto et al., Mol. Cell. Biol. 11: 3191-3202 (1991)). An EGFR/erbB-2^(COOH) chimera, in which the carboxyl terminal (COOH) domain of gp185^(erbB-2) was substituted for the COOH domain of EGFR, was able to phosphorylate eps15 comparably to the wild type EGFR. Conversely, the substitution of the tyrosine kinase (TK) region of gp185^(erbB-2) into EGFR (EGFR/erbB-2^(TK)) yielded a molecule which was severely impaired in its ability to phosphorylate eps15. Further mapping showed that the substitution of the first .sup.˜ 150 amino acid of gp185^(erbB-2) for the analogous region of EGFR (juxtamembrane or TK-1 domain) was alone sufficient to decrease the ability of the chimeric EGFR/erbB-2^(TK1) receptor to phosphorylate eps15.

These differences were not due to different levels of expression of the chimeric molecules and wild type EGFR in NIH-3T3 transfectants, as shown by ¹²⁵ I-EGF binding experiments, nor to differences in the levels of eps15 expressed in the various transfectants. In addition, it has previously been shown that all of the employed chimerae possess in vivo tyrosine kinase activity as demonstrated by efficient autophosphorylation and competence at phosphorylating intracellular substrates, including PLC-γ (Fazioli et al., Mol. Cell. Biol. 11: 2040-2048 (1991); Segatto et al., Mol. Cell. Biol. 11: 3191-3202 (1991); Di Fiore et al., Mol. Cell. Biol. 10: 2749-2756 (1990)), and ability to deliver sizable mitogenic signals in NIH-3T3 cells (Di Fiore et al., Mol. Cell. Biol. 10: 2749-2756 (1990); Segatto et al., Mol. Cell. Biol. 11: 3191-3202 (1991); Fazioli et al., Mol. Cell. Biol. 11: 2040-2048 (1991); Di Fiore et al., Science 248: 79-83 (1990)). It was concluded, therefore, that the juxtamembrane (or TK-1) region of the EGFR contains the determinants responsible for efficient phosphorylation of eps15.

The mechanisms by which the EGFR phosphorylates the eps15 gene product remain to be established. In repeated experiments, co-immunoprecipitation of these two proteins was not detected. This indicates either phosphorylation by a second kinase, activated by the EGFR, or disruption of the EGFR/eps15 complex even under the mild lysis conditions employed in the co-immunoprecipitation experiments. It is widely accepted now that detergent-resistant interactions between TKRs and some of their substrates are made possible by binding of specialized regions of substrate molecules, known as SH2 domains, to phosphotyrosine motifs present in TKRs (Koch et al., Science 252: 668-674 (1991)). To this regard, it is noted that the eps15 cDNA describes the synthesis of a protein lacking SH2 domains. The SH2/pTyr interactions, however, may not be the sole ones responsible for receptor/substrate interactions, and others, possibly of a weaker type, might exist. There is now evidence that purified EGFR can directly phosphorylate bacterially expressed p142^(eps15) in vitro assays (W. Wong and P. P. Di Fiore, unpublished observations).

J. Detection of Altered Mitogenic Signal Transduction

The eps15 of the present invention is also valuable in detection of altered mitogenic signal transduction. Such altered signal transduction can be ascertained by measurement of eps15 levels in vivo or in vitro using the immunoassays discussed above. Alternatively, altered forms of eps15 can be detected by using at least a portion of the DNA encoding eps15 as a probe to isolate the DNA encoding a possibly altered form of eps15. Techniques of the type disclosed in Examples 2 and 3, or other conventional techniques, can then be used to sequence the isolated DNA. By comparing this sequence to the known sequence, alterations can be detected.

If an altered eps15 sequence or abnormal levels of eps15 are detected in malignant tissue, antisense therapy can be utilized in accordance with Example 11 to halt translation of the protein, or triple helix therapy in accordance with Example 12 to shut-off RNA transcription, and, thus, in both cases, to interfere with mitogenesis.

K. Increasing the Mitogenic Response of Cells

We have also discovered that the mitogenic response of cells to mitogenic factors can be enhanced by delivering eps15 to the cell in amounts greater than the natural amounts. The optimum dosage for any particular cell type can be empirically determined in a relatively straightforward manner. It is apparent that an increased dosage will have a mitogenesis-enhancing effect as demonstrated by the observation that overexpression of eps15 is sufficient to transform NIH-3T3 cells.

Particular aspects of the invention may be more readily understood by reference to the following examples, which are intended to exemplify the invention, without limiting its scope to the particular exemplified embodiments.

EXAMPLE 1

Generation of Polyclonal Antibody Against EGFR Substrates

Immunoaffinity chromatography techniques were used to isolate proteins which were tyrosine phosphorylated by EGFR, as described by Fazioli et al., J. Biol. Chem. 267: 5155-5161 (1992). Briefly, genetically engineered NIH-3T3 cells which overexpress EGFR (NIH-EGFR) (Fazioli et al., Mol. Cell Biol. 11: 2040-2048 (1991)) were maintained in DMEM (Gibco, Gaithersburg, Md.) supplemented with 10% calf serum (Gibco, supra). Subconfluent cell monolayers were treated with EGF (Upstate Biotechnology, Inc. (UBI), Lake Placid, N.Y.), and lysed. EGFR was removed from the lysate using an anti-EGFR column prepared by linking anti-EGFR monoclonal antibody (Ab1, Oncogene Science, Uniondale, N.Y.) to agarose beads. The lysate was then contacted with an anti-phosphotyrosine (anti-pTyr, Oncogene Science, supra) column; the column was washed; and the bound protein was then eluted. Fractions were collected and were used to immunize two New Zealand white rabbits, yielding two polyclonal immune sera, designated 450 and 451.

EXAMPLE 2

Identification of eps15 cDNA Clone

A pool of sera 450 and 451 from Example 1 was used to screen a commercial (Clontech, Palo Alto, Calif.) λgt11 library from NIH-3T3 cells. Recombinant plaques (10⁶) were initially screened with a 1:200 dilution of each antibody in TTBS (0.05% Tween 20 mM Tris-HCl pH 7.5! 150 mM NaCl) containing 1% BSA. Detection was carried out with a goat anti-rabbit Ab conjugated to alkaline phosphatase by utilizing a commercial kit (Picoblue, Stratagene, La Jolla, Calif.) according to the manufacturer's specification. Analysis yielded several positive plaques;. one of these clones (pl 15) contained an insert of ˜1.8 kbp which was completely sequenced and had no correspondence to sequences present in the Genbank or EMBL data banks. The pl 15 insert was subcloned in the Eco RI site of pBluescript® (Stratagene, supra).

The sequence of pl 15 predicted an ORF which started in the expected frame with the β-galactosidase portion of λgt11 but contained neither an initiation nor a stop codon. It was concluded that pl 15 represented a partial cDNA encoding a novel protein, now designated eps15 (for EGFR pathway substrate #15).

EXAMPLE 3

Isolation and Sequencing of eps15 cDNA

Full length cDNA for eps15 (pCEV-eps15) was obtained by screening a mouse keratinocyte cDNA library (Miki et al., Science 251: 72-75 (1991)) using the pl 15 insert from Example 2 as a probe according to standard procedures (Sambrook et al., Molecular Biology: A Laboratory Manual (Cold Spring Harbor Laboratory Press 1989). DNA sequencing was performed by the dideoxy-termination method on both strands of the cDNA, using a commercial kit (SEQUENASE®, United States Biochemical, Cleveland, Ohio). The resulting DNA sequence is identified as SEQ ID NO:3. The 3033 bp sequence was found to contain a stop codon at position 2802-2804 followed by a 3' untranslated sequence containing a putative polyadenylation site (AATTAAA) starting at position 3014. The first in-frame ATG (position 111-113) conformed to Kozak's rules for translational initiation (Kozak M., J. Cell Biol. 108: 229-241 (1989)) and was preceded by 110 bp of 5' untranslated sequence.

EXAMPLE 4

Preparation of Anti-eps15 Antibody and Expression of eps15 Gene Product

Polyclonal antibodies specific for the eps15 gene product were generated against a recombinant trpE fusion protein. To this end the open reading frame (ORF) of pl 15, positioned between two Eco RI sites, was cloned in frame in the Eco RI site of the pATH 11 bacterial expression vector. The recombinant fusion protein was expressed by induction with indoleacrylic acid (Koerner et al., Methods Enzymol. 194: 477-490 (1991)), gel purified and used to immunize New Zealand rabbits. A commercially available anti-phosphotyrosine (anti-pTyr) monoclonal antibody (Upstate Biotechnology, Lake Placid, N.Y.) was also used. Specificity of detection for anti-pTyr was controlled as described previously (Fazioli et al., J. Biol. Chem. 267: 5155-5161 (1992); Fazioli et al., Mol. Cell Biol. 11: 2040-2048 (1991), which are both incorporated by this reference).

The anti-eps15 serum specifically recognized a major species of Mr 142 kDa (p142^(eps15)) and a minor component of 155 kDa (p155^(eps15)) in NIH-EGFR cells. A second doublet of much lower intensity, migrating as 119-122 kDa, was also specifically recognized by the anti-eps15 serum. The size of the major eps15 band (142 kDa) was significantly larger than that of the deduced protein (approximately 98 kDa). This difference is not due to N-linked glycosylation, since no differences in the electrophoretic mobility of eps15 were detectable after tunycamicin treatment. In addition, in vivo transcription and translation of the pCEV-eps15 cDNA yielded a protein comigrating with authentic p142^(eps15). Therefore, the discrepancy between the actual and deduced size of eps15 should be ascribed to either abnormalities in its gel migration, or to-post-translational modification still occurring in a reticulocytes lysate. Analysis of sequential immunoprecipitation and immunoblotting with anti-pTyr and anti-eps15 antibodies of lysates prepared from NIH-EGFR cells, before and after in vitro EGF treatment, indicated that both p142^(eps15) and p155^(eps15) were phosphorylated in vitro on tyrosine following EGFR activation.

Anti-pTyr recovery of the eps15 product might be due to direct recognition of phosphotyrosil residues or to association with other pTyr-containing proteins. To distinguish between these possibilities immunoprecipitation experiments with anti-eps15 were performed, followed by immunoblot with anti-pTyr. It was found that p142^(eps15) was readily detectable under these conditions in cell lysates obtained from NIH-EGFR cells triggered with EGF; p142^(eps15) was not detectable in cell lysates from untreated cells. Under these same conditions, it was not easy to detect p155^(eps15) in EGF-treated NIH-EGFR cells due to the presence of a superimposed background band in experiments on cell lysates from untreated cells. Nevertheless, these results established that eps15 is tyrosine phosphorylated following EGFR activation.

EXAMPLE 5

Derivation of Partial Human eps15 Sequence

The partial human eps15 sequence was obtained by the polymerase chain reaction (PCR) method using two oligonucleotides from the mouse eps15 cDNA sequence as primers to amplify the human cDNA fragment from a human cDNA library. The library used was from A101D cells (human melanoma cells, Giard et al., J. Nat'l Cancer Institute 51: 1417-1423 (1973)), although other readily-available libraries could be used. The library was prepared using the method of Miki et al., Science 251: 72-75 (1991). The two oligonucleotide PCR primers included sequences corresponding to positions 2137-2157 and 2618-2638 of the mouse eps15 cDNA sequence (SEQ ID NO:3), respectively. A typical PCR contained 100 ng of cDNA library DNA, 5 units of Taq DNA polymerase (Boehringer Mannheim, Indianapolis, Ind.), 1 μM of each oligonucleotide primer, 200 μM dNTPs, 10 mM Tris-HCl (pH 8.3), 1.5 mM MgCl₂, 50 mM KCl, 0.1 mg/ml gelatin. Reactions were carried out for 35 cycles of 1.5 min at 94° C., 5 min at 50° C., and 2 min at 72° C. Reactions were then terminated with an additional 10 min at 72° C. The PCR products were purified by chrome spin+TE-400 column (Clontech, Palo Alto, Calif.), subcloned into pBluescript® II KS vector (Stratagene, La Jolla, Calif.), and sequenced by the dideoxy-termination methods on both strands using the SEQUENASE® DNA sequencing kit (USB, Cleveland, Ohio).

EXAMPLE 6

Identification of Human eps15 cDNA clone

A PCR-amplified fragment of 519 bp, acquired in accordance with Example 5, was used to screen a human cDNA library from M426 cells, although other readily available libraries could be used. Several clones were isolated and the longest cDNA (SEQ ID NO:1) was DNA- sequenced by the dideoxy-termination method, as explained in Example 3. The human eps15. clone obtained was 4165 nucleotides long and possessed an ATG at positions 21-23 which conformed to Kozak's rules for translation initiation (Kozak et al., J. Cell. Biol. 108: 229-241 (1989)). The clone displayed a stop codon at position 2709-2711. It was not determined whether this was a full length clone, but it encompassed the entire open reading frame of eps15, as established by comparison to the mouse cDNA.

EXAMPLE 7

Isolation of eps15 Sequences from other Organisms

Two potentially complementary strategies are used to isolate and clone the eps15 gene in other species. The first strategy is essentially the same as the one used to obtain the human eps15 sequence. Briefly, two oligonucleotides from the mouse or the human eps15 sequence can be used to PCR amplify fragments of eps15 cDNA from other species. The oligonucleotides can be designed from regions of high nucleotide identity between the human and mouse sequence, in a way to increase the probability of obtaining an efficient matching of the primers with the eps15 sequences of other species. Alternatively, degenerate PCR primers can be used to amplify sequences similar to the mouse or human gene. The template for the PCR reaction can be a cDNA library from cells of another species or a cDNA obtained by reverse transcriptase (in the so called reverse transcriptase/PCR method) directly from the mRNA of another species. A second approach relies on classical low stringency hybridization of nucleic acids. In this case a probe representing the eps15 cDNA from human or mouse is hybridized, under relaxed conditions of stringency, against libraries (cDNA or genomic) prepared from cells of other species. Relaxed stringency is obtained by modifying the temperature and the ionic strength of the hybridization buffer by well known methods, in a manner designed to allow stable formation of hybrids which are not 100% matching (as expected in inter-species hybridization). The positives are then analyzed as described above (see Example 6). A complete review on low stringency hybridization is to be found in Kraus et al., Methods in Enzymology 200: 546-556 (1991), which is hereby incorporated by reference.

EXAMPLE 8

Quantitative Immunoassay for eps15

It is often desirable to determine the quantity of eps15 in a sample. This can be particularly useful in clinical research, as well as in detecting abnormalities in mitogenic signal transduction in malignant tissue. Additionally, in many human tumors, tumor markers are released in the blood stream at levels which correlate with the size of the tumor and its clinical stage. Determining the levels of markers (such as eps15) in biological fluids can be advantageous in aiding the diagnostic procedures and in monitoring the effectiveness of therapy.

In one exemplary technique, anti-eps15 antibody from Example 4is immobilized to an agarose column, as explained in Example 1. Sample is then directed through the column where eps15 in the sample is bound by the immobilized antibody. Next, a known quantity of radiolabeled anti-eps15 antibody is directed through the column. The quantity of labeled antibody which is not retained on the column is measured, and bears a relationship to the quantity of eps15 in the sample.

Another exemplary technique is liquid phase radioimmunoassay. First, a standard measurement is made by challenging a known amount of purified eps15, radiolabeled in a conventional manner, against a known amount of anti-eps15 antibody. The resulting immunocomplex is recovered by centrifugation, and the radioactivity of the centrifugate is determined. This value is used as a standard against which later measurements are compared.

Next, a sample, containing an unknown amount of eps15, is challenged against the same known amount (used in making the standard measurement) of anti-eps15 antibody. Then, the same amount of labeled eps15 used in making the standard measurement is added to the reaction mixture, followed by centrifugation and measurement of radioactivity as explained above. The decrease in the immunoprecipitated radioactivity (in comparison to the standard) is proportional to the amount of eps15 in the sample.

Of course, in addition to the foregoing exemplary methods, any of the well known conventional immunoassay methods may similarly be used.

EXAMPLE 9

Assay for Phosphorylation of eps15 by TKRs

A biological sample is assayed for TKR tyrosine kinase activity by combining the sample with known quantities of eps15 and ³² P-labeled γ-ATP. The sample is then contacted with anti-eps15 from Example 4 immobilized on a column; the column is washed; and the bound eps15 is eluted with 0.1M glycine, pH 2.5. The eluant is then subjected to fractionation to separate the resulting radiolabeled eps15 from the free radioactivity in the sample using any conventional technique, such as precipitation in 5-10% trichloroacetic acid. Following fractionation, the amount of radioactivity incorporated into the eps15 is counted to measure TKR tyrosine kinase activity of the sample.

EXAMPLE 10

Alternative Assay for TKR Tyrosine Kinase Activity

100 ng of eps15 is added to 1 ml buffered cell lysate suspected of having TKR tyrosine kinase activity, together with 30 μC ³² P-γATP. Following incubation, the mixture is heated to 100° C. in a solution containing sodium lauryl sulfate (SDS) and β-mercaptoethanol. Aliquots are electrophoresed on 10-15% gradient SDS polyacrylamide gels and exposed to X-Omat X-ray film to identify radioactive eps15. Cell lysate from eps15-transfected cells incubated in the presence of radiolabeled amino acids is used to confirm the location on the gel of the phosphorylated eps15.

EXAMPLE 11

Preparation and Use of Antisense Oligonucleotides

Antisense RNA molecules are known to be useful for regulating translation within the cell. Antisense RNA molecules can be produced from the sequences of the present invention. These antisense molecules can be used as therapeutic agents to regulate gene expression.

The antisense molecules are obtained from a nucleotide sequence by reversing the orientation of the coding region with regard to the promoter. Thus, the antisense RNA is complementary to the corresponding mRNA. For a review of antisense design see Green et al., Ann. Rev Biochem. 55: 569-597 (1986), which is hereby incorporated by reference. The antisense sequences can contain modified sugar phosphate backbones to increase stability and make them less sensitive to RNase activity. Examples of the modifications are described by Rossi et al., Pharmocol. Ther. 50: 245-254, (1991).

Antisense molecules are introduced into cells that express the eps15 gene. In a preferred application of this invention, the effectiveness of antisense inhibition on translation can be monitored using techniques that include, but are not limited to, antibody-mediated tests such as RIAs and ELISA, functional assays, or radiolabeling. The antisense molecule is introduced into the cells by diffusion or by transfection procedures known in the art. The molecules are introduced onto cell samples at a number of different concentrations, preferably between 1×10⁻¹⁰ M to 1×10⁻⁴ M. Once the minimum concentration that can adequately control translation is identified, the optimized dose is translated into a dosage suitable for use in vivo. For example, an inhibiting concentration in culture of 1×10⁻⁷ M translates into a dose of approximately 0.6 mg/kg bodyweight. Levels of oligonucleotide approaching 100 mg/kg bodyweight or higher may be possible after testing the toxicity of the oligonucleotide in laboratory animals.

The antisense molecule can be introduced into the body as a bare or naked oligonucleotide, oligonucleotide encapsulated in lipid, oligonucleotide sequence encapsidated by viral protein, or as oligonucleotide contained in an expression vector. The antisense oligonucleotide is preferably introduced into the vertebrate by injection. Alternatively, cells from the vertebrate are removed, treated with the antisense oligonucleotide, and reintroduced into the vertebrate. It is further contemplated that the antisense oligonucleotide sequence is incorporated into a ribozyme sequence to enable the antisense to bind and cleave its target. For technical applications of ribozyme and antisense oligonucleotides, see Rossi et al., supra.

EXAMPLE 12

Preparation and Use of Triple Helix Probes

Triple helix oligonucleotides are used to inhibit transcription from a genome. They are particularly useful for studying alterations in cell activity as it is associated with a particular gene. The eps15 sequence or, more preferably, a portion thereof, can be used to inhibit gene expression in individuals suffering from disorders associated with the eps15 gene. Similarly, a portion of the eps15 gene sequence, or the entirety thereof, can be used to study the effect of inhibiting transcription of the gene within a cell. Traditionally, homopurine sequences were considered the most useful. However, homopyrimidine sequences can also inhibit gene expression. Thus, both types of sequences corresponding to the eps15 gene are contemplated within the scope of this invention. Homopyrimidine oligonucleotides bind to the major groove at homopurine:homopyrimidine sequences. As an example, 10-mer to 20-mer homopyrimidine sequences from the eps15 gene can be used to inhibit expression from homopurine sequences. Moreover the natural (beta) anomers of the oligonucleotide units can be replaced with alpha anomers to render the oligonucleotide more resistant to nucleases. Further, an intercalating agent such as ethidium bromide, or the like, can be attached to the 3' end of the alpha oligonucleotide to stabilize the triple helix. For information on the generation of oligonucleotides suitable for triple helix formation see Griffin et al., Science 245: 967-971 (1989), which is hereby incorporated by this reference.

The oligonucleotides may be prepared on an oligonucleotide synthesizer or they may be purchased commercially from a company specializing in custom oligonucleotide synthesis. The sequences are introduced into cells in culture using techniques known in the art that include but are not limited to calcium phosphate precipitation, DEAE-Dextran, electroporation, liposome-mediated transfection or native uptake. Treated cells are monitored for altered cell function. Alternatively, cells from the organism are extracted, treated with the triple helix oligonucleotide, and reimplanted into the organism.

While particular embodiments of the invention have been described in detail, it will be apparent to those skilled in the art that these embodiments are exemplary rather than limiting, and the true scope of the invention is that defined within the attached claims.

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 4                                                   (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 4165 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 21..2709                                                         (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                        AACCGCATGATGGAAACACCATGGCTGCGGCGGCCCAGCTCTCTCTGACA50                           MetAlaAlaAlaAlaGlnLeuSerLeuThr                                                 1510                                                                           CAGTTATCAAGTGGGAATCCTGTATATGAAAAATACTATAGACAGGTT98                             GlnLeuSerSerGlyAsnProValTyrGluLysTyrTyrArgGlnVal                               152025                                                                         GATACAGGCAATACTGGAAGGGTGTTGGCTTCTGATGCTGCTGCTTTC146                            AspThrGlyAsnThrGlyArgValLeuAlaSerAspAlaAlaAlaPhe                               303540                                                                         CTGAAAAAATCAGGGCTTCCAGACTTGATACTTGGAAAGATTTGGGAT194                            LeuLysLysSerGlyLeuProAspLeuIleLeuGlyLysIleTrpAsp                               455055                                                                         TTAGCCGACACAGATGGCAAAGGTATCCTGAACAAACAAGAATTCTTT242                            LeuAlaAspThrAspGlyLysGlyIleLeuAsnLysGlnGluPhePhe                               606570                                                                         GTTGCTTTGCGTCTTGTGGCATGTGCCCAGAATGGATTGGAAGTTTCA290                            ValAlaLeuArgLeuValAlaCysAlaGlnAsnGlyLeuGluValSer                               75808590                                                                       CTAAGTAGTTTGAACCTGGCTGTTCCTCCACCAAGATTTCATGATACC338                            LeuSerSerLeuAsnLeuAlaValProProProArgPheHisAspThr                               95100105                                                                       AGTAGTCCTTTGCTAATCAGTGGAACCTCTGCAGCTGAGCTCCCATGG386                            SerSerProLeuLeuIleSerGlyThrSerAlaAlaGluLeuProTrp                               110115120                                                                      GCTGTAAAACCTGAAGATAAGGCCAAATATGATGCAATATTTGATAGT434                            AlaValLysProGluAspLysAlaLysTyrAspAlaIlePheAspSer                               125130135                                                                      TTAAGCCCAGTGAATGGATTTCTGTCTGGTGATAAAGTGAAACCAGTG482                            LeuSerProValAsnGlyPheLeuSerGlyAspLysValLysProVal                               140145150                                                                      TTGCTCAACTCTAAGTTACCTGTGGATATCCTTGGAAGAGTTTGGGAG530                            LeuLeuAsnSerLysLeuProValAspIleLeuGlyArgValTrpGlu                               155160165170                                                                   TTGAGTGATATTGACCATGATGGAATGCTTGACAGAGATGAGTTTGCA578                            LeuSerAspIleAspHisAspGlyMetLeuAspArgAspGluPheAla                               175180185                                                                      GTTGCCATGTTTTTGGTATACTGTGCACTGGAGAAAGAACCTGTGCCA626                            ValAlaMetPheLeuValTyrCysAlaLeuGluLysGluProValPro                               190195200                                                                      ATGTCCTTGCCTCCAGCCTTGGTGCCACCATCTAAGAGAAAAACGTGG674                            MetSerLeuProProAlaLeuValProProSerLysArgLysThrTrp                               205210215                                                                      GTTGTATCCCCTGCAGAAAAAGCTAAATATGATGAAATCTTCCTGAAA722                            ValValSerProAlaGluLysAlaLysTyrAspGluIlePheLeuLys                               220225230                                                                      ACTGATAAAGATATGGACGGATTTGTGTCTGGATTGGAGGTCCGTGAA770                            ThrAspLysAspMetAspGlyPheValSerGlyLeuGluValArgGlu                               235240245250                                                                   ATATTCTTGAAAACAGGTTTACCTTCTACCTTACTAGCCCATATATGG818                            IlePheLeuLysThrGlyLeuProSerThrLeuLeuAlaHisIleTrp                               255260265                                                                      TCATTATGCGACACAAAGGACTGTGGGAAGCTTTCAAAGGATCAGTTT866                            SerLeuCysAspThrLysAspCysGlyLysLeuSerLysAspGlnPhe                               270275280                                                                      GCCTTGGCTTTTCACTTAATCAGTCAGAAGTTAATCAAGGGCATTGAT914                            AlaLeuAlaPheHisLeuIleSerGlnLysLeuIleLysGlyIleAsp                               285290295                                                                      CCTCCTCACGTTCTTACTCCTGAAATGATTCCACCATCAGACAGGGCC962                            ProProHisValLeuThrProGluMetIleProProSerAspArgAla                               300305310                                                                      AGTTTACAAAAGAACATCATAGGATCAAGTCCTGTTGCAGATTTCTCT1010                           SerLeuGlnLysAsnIleIleGlySerSerProValAlaAspPheSer                               315320325330                                                                   GCTATTAAGGAACTAGATACTCTTAACAATGAAATAGTTGACCTACAG1058                           AlaIleLysGluLeuAspThrLeuAsnAsnGluIleValAspLeuGln                               335340345                                                                      AGGGAAAAGAATAATGTGGAACAGGACCTTAAGGAGAAGGAAGATACT1106                           ArgGluLysAsnAsnValGluGlnAspLeuLysGluLysGluAspThr                               350355360                                                                      ATTAAACAGAGGACAAGTGAGGTTCAGGATCTTCAAGATGAAGTTCAA1154                           IleLysGlnArgThrSerGluValGlnAspLeuGlnAspGluValGln                               365370375                                                                      AGGGAGAATACTAATCTGCAAAAACTACAGGCCCAGAAACAGCAGGTA1202                           ArgGluAsnThrAsnLeuGlnLysLeuGlnAlaGlnLysGlnGlnVal                               380385390                                                                      CAGGAACTCCTTGATGAACTGGATGAGCAGAAAGCCCAGCTGGAGGAG1250                           GlnGluLeuLeuAspGluLeuAspGluGlnLysAlaGlnLeuGluGlu                               395400405410                                                                   CAACTCAAGGAAGTCAGAAAGAAATGTGCTGAGGAGGCCCAACTGATC1298                           GlnLeuLysGluValArgLysLysCysAlaGluGluAlaGlnLeuIle                               415420425                                                                      TCTTCTCTGAAAGCTGAATTAACTAGTCAGGAATCGCAGATCTCCACT1346                           SerSerLeuLysAlaGluLeuThrSerGlnGluSerGlnIleSerThr                               430435440                                                                      TATGAAGAAGAATTGGCAAAAGCTAGAGAAGAGCTGAGCCGTCTACAG1394                           TyrGluGluGluLeuAlaLysAlaArgGluGluLeuSerArgLeuGln                               445450455                                                                      CAAGAAACAGCAGAATTGGAGGAGAGTGTAGAGTCAGGGAAGGCTCAG1442                           GlnGluThrAlaGluLeuGluGluSerValGluSerGlyLysAlaGln                               460465470                                                                      TTGGAACCTCTTCAGCAGCACCTACAAGATTCACAACAGGAAATTAGT1490                           LeuGluProLeuGlnGlnHisLeuGlnAspSerGlnGlnGluIleSer                               475480485490                                                                   TCAATGCAAATGAAACTGATGGAAATGAAAGATTTGGAAAATCATAAT1538                           SerMetGlnMetLysLeuMetGluMetLysAspLeuGluAsnHisAsn                               495500505                                                                      AGTCAGTTAAATTGGTGCAGTAGCCCACACAGCATTCTTGTAAACGGA1586                           SerGlnLeuAsnTrpCysSerSerProHisSerIleLeuValAsnGly                               510515520                                                                      GCTACAGATTATTGCAGCCTCAGCACCAGCAGCAGTGAAACAGCCAAC1634                           AlaThrAspTyrCysSerLeuSerThrSerSerSerGluThrAlaAsn                               525530535                                                                      CTTAATGAACATGTTGAAGGCCAGAGCAACCTAGAGTCTGAGCCCATA1682                           LeuAsnGluHisValGluGlyGlnSerAsnLeuGluSerGluProIle                               540545550                                                                      CACCAGGAATCTCCAGCAAGAAGTAGTCCTGAACTACTGCCTTCTGGT1730                           HisGlnGluSerProAlaArgSerSerProGluLeuLeuProSerGly                               555560565570                                                                   GTGACTGATGAAAATGAGGTGACTACAGCTGTTACTGAAAAAGTTTGT1778                           ValThrAspGluAsnGluValThrThrAlaValThrGluLysValCys                               575580585                                                                      TCTGAACTCGACAATAATAGACATTCAAAAGAGGAAGATCCATTTAAT1826                           SerGluLeuAspAsnAsnArgHisSerLysGluGluAspProPheAsn                               590595600                                                                      GTAGACTCAAGTTCGCTGACAGGTCCAGTTGCAGATACAAACTTGGAT1874                           ValAspSerSerSerLeuThrGlyProValAlaAspThrAsnLeuAsp                               605610615                                                                      TTTTTCCAGTCTGATCCTTTTGTTGGCAGTGATCCTTTCAAGGATGAT1922                           PhePheGlnSerAspProPheValGlySerAspProPheLysAspAsp                               620625630                                                                      CCTTTTGGAAAAATCGATCCATTTGGTGGTGATCCTTTCAAAGGTTCA1970                           ProPheGlyLysIleAspProPheGlyGlyAspProPheLysGlySer                               635640645650                                                                   GATCCATTTGCATCAGACTGTTTCTTCAGGCAATCTACTGATCCTTTT2018                           AspProPheAlaSerAspCysPhePheArgGlnSerThrAspProPhe                               655660665                                                                      GCCACTTCAAGCACTGACCCTTTCAGTGCAGCCAACAATAGCAGTATT2066                           AlaThrSerSerThrAspProPheSerAlaAlaAsnAsnSerSerIle                               670675680                                                                      ACATCGGTAGAAACGTTGAAGCACAATGATCCTTTTGCTCCTGGTGGA2114                           ThrSerValGluThrLeuLysHisAsnAspProPheAlaProGlyGly                               685690695                                                                      ACAGTTGTTGCAGCAAGCGATTCAGCCACAGACCCCTTTGCTTCTGTT2162                           ThrValValAlaAlaSerAspSerAlaThrAspProPheAlaSerVal                               700705710                                                                      TTTGGGAATGAATCATTTGGAGGTGGATTTGCTGACTTCAGCACATTG2210                           PheGlyAsnGluSerPheGlyGlyGlyPheAlaAspPheSerThrLeu                               715720725730                                                                   TCAAAGGTCAACAATGAAGATCCTTTTCGTTCAGCCACATCGAGCTCT2258                           SerLysValAsnAsnGluAspProPheArgSerAlaThrSerSerSer                               735740745                                                                      GTCAGCAACGTAGTGATTACAAAAAATGTATTTGAGGAAACATCGGTC2306                           ValSerAsnValValIleThrLysAsnValPheGluGluThrSerVal                               750755760                                                                      AAAAGTGAAGATGAACCCCCAGCACTGCCACCAAAGATCGGAACTCCA2354                           LysSerGluAspGluProProAlaLeuProProLysIleGlyThrPro                               765770775                                                                      ACAAGACCCTGCCCTCTACCACCTGGGAAAAGATCCATCAACAAATTG2402                           ThrArgProCysProLeuProProGlyLysArgSerIleAsnLysLeu                               780785790                                                                      GATTCTCCTGATCCCTTTAAACTGAATGATCCATTTCAGCCTTTCCCA2450                           AspSerProAspProPheLysLeuAsnAspProPheGlnProPhePro                               795800805810                                                                   GGCAACGATAGCCCCAAAGAAAAAGATCCTGAAATGTTTTGTGATCCA2498                           GlyAsnAspSerProLysGluLysAspProGluMetPheCysAspPro                               815820825                                                                      TTCACTTCTGCTACTACCACTACCAATAAAGAGGCTGATCCAAGCAAT2546                           PheThrSerAlaThrThrThrThrAsnLysGluAlaAspProSerAsn                               830835840                                                                      TTTGCCAACTTCAGTGCTTATCCCTCTGAAGAAGATATGATCGAATGG2594                           PheAlaAsnPheSerAlaTyrProSerGluGluAspMetIleGluTrp                               845850855                                                                      GCCAAGAGGGAAAGTGAGAGAGAGGAAGAGCAGAGGCTTGCCCGACTA2642                           AlaLysArgGluSerGluArgGluGluGluGlnArgLeuAlaArgLeu                               860865870                                                                      AATCAGCAGGAACAAGAAGACTTAGAACTGGCTATTGCACTCAGCAAA2690                           AsnGlnGlnGluGlnGluAspLeuGluLeuAlaIleAlaLeuSerLys                               875880885890                                                                   TCTGAGATATCAGAAGCATGAAGAATTCTCTTGTTCTTTGGCAACAATA2739                          SerGluIleSerGluAla                                                             895                                                                            TAGTATTCTTCTTCCTGAATACTGAAACTATTTACAATGTGTATCAAAACTACCTGTGAG2799               CATGGGAATACAAAAGGTTTGAGATTCCTGTAAATGTGACAAAATTTTAGGATTTTTTTT2859               TTTTCTTCATTACAGATTCGTCTTTTTTTTTTTTTCTTATAAAAGCCGTAACCCAGTCAG2919               ACAAATTCACCTTCACTTAGGCCCCTGTTCTGGTATACATTTACTGTGAGCTTTTGCCTG2979               CCTGTGCTATTTTACTTGTAAAGCTAGAGCACCCAAGCTTCTGCCTTCTGGAATATAGAG3039               AAATAGTTTCACCCTGCACTACCCTGTTCTGTAGTTATTCTGATGATAGCCAGTGAGGTT3099               CTTAAAGTTTGCAGTATTCTCCCCTGATTGGAATGGTTGAGTGAGGGTAAGGGAAAGAAT3159               ATCTTATTTCTTTTATGATTGGTGCAAATTGGCTAAAGTGCATTTTTAAATTTCCTCTAC3219               TTAATTTGTTTTTCAGAGATAAGGAAAAATATTTTGCACAGATTTACTCCACTATGGAAA3279               AGGGATGCTGTAGGTTGAACCATTATAGCCTCAGATTCGATCTTTTCCTAACTAAAAATA3339               TTAAAGCCTCATGTGTGAAATAAATTTTTAAAAAGATTTATCTGGATTTAGAGAATTTTA3399               GATCAACAGATACCTCTCAGTGTGTTTGCTAATTAATAAAAATCAGTTTCTTACAAATAA3459               AGTTTGTAAGAAAATGTTCATTTTAAGTGATAGATAGTGGAGAAAATTTATCACCTAAAA3519               TATACCCATCAGTATAAGGCAAGCAAAAGTCTTAACATGGCAGCCATTCTGCCTTTGCCG3579               TGGCCCTGTCCTGTTTAGTTCTTAGTGGGTTAATTTTTGTACTTTTGCAGAAGAAACTTC3639               AGCAAGCTAGAACTGGAAGGTACTTTAATTTTTCATATATATTTGTTTTTTTTTTTTTAA3699               TGAAGGCTCATTTACTTGAAATGTAAAAACTTTCACTGAATACAAATAGAAAAAGTGATG3759               TGTTTTATATCATATTGCTTTTTGTCCATCTTTGTGGTTTAGTTTATTTACTCACTTCAT3819               GTTTTTCACCTATAAAATTGTCAAGCTAGCAAAAAAACTCTTGTTTTTTTAATTGGGAGA3879               GAAGAGACCTGCCAGATTATCAGACCTCTTCATGTTAAAAGACCATCTCCTGTAAAACTG3939               ACCTAGTGGACAAGCTGAATTTGAAATAGACTGTGAAGTAAGCTGTAACTTGTCATTTTA3999               ATTTTGTTTAACACGGTTACTGACTTAGATGATGTATTAAATACCAAGATAAAGAAAAAT4059               GCACCTAAAATCTAATTAGAATTCTCTGGGTCACCAAGTCAAGGTGGTATTGATCTGTGT4119               TAATCTGAGTAACTTATTGCCTAGCCTATAAATAAATTCCAATATC4165                             (2) INFORMATION FOR SEQ ID NO:2:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 896 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                        MetAlaAlaAlaAlaGlnLeuSerLeuThrGlnLeuSerSerGlyAsn                               151015                                                                         ProValTyrGluLysTyrTyrArgGlnValAspThrGlyAsnThrGly                               202530                                                                         ArgValLeuAlaSerAspAlaAlaAlaPheLeuLysLysSerGlyLeu                               354045                                                                         ProAspLeuIleLeuGlyLysIleTrpAspLeuAlaAspThrAspGly                               505560                                                                         LysGlyIleLeuAsnLysGlnGluPhePheValAlaLeuArgLeuVal                               65707580                                                                       AlaCysAlaGlnAsnGlyLeuGluValSerLeuSerSerLeuAsnLeu                               859095                                                                         AlaValProProProArgPheHisAspThrSerSerProLeuLeuIle                               100105110                                                                      SerGlyThrSerAlaAlaGluLeuProTrpAlaValLysProGluAsp                               115120125                                                                      LysAlaLysTyrAspAlaIlePheAspSerLeuSerProValAsnGly                               130135140                                                                      PheLeuSerGlyAspLysValLysProValLeuLeuAsnSerLysLeu                               145150155160                                                                   ProValAspIleLeuGlyArgValTrpGluLeuSerAspIleAspHis                               165170175                                                                      AspGlyMetLeuAspArgAspGluPheAlaValAlaMetPheLeuVal                               180185190                                                                      TyrCysAlaLeuGluLysGluProValProMetSerLeuProProAla                               195200205                                                                      LeuValProProSerLysArgLysThrTrpValValSerProAlaGlu                               210215220                                                                      LysAlaLysTyrAspGluIlePheLeuLysThrAspLysAspMetAsp                               225230235240                                                                   GlyPheValSerGlyLeuGluValArgGluIlePheLeuLysThrGly                               245250255                                                                      LeuProSerThrLeuLeuAlaHisIleTrpSerLeuCysAspThrLys                               260265270                                                                      AspCysGlyLysLeuSerLysAspGlnPheAlaLeuAlaPheHisLeu                               275280285                                                                      IleSerGlnLysLeuIleLysGlyIleAspProProHisValLeuThr                               290295300                                                                      ProGluMetIleProProSerAspArgAlaSerLeuGlnLysAsnIle                               305310315320                                                                   IleGlySerSerProValAlaAspPheSerAlaIleLysGluLeuAsp                               325330335                                                                      ThrLeuAsnAsnGluIleValAspLeuGlnArgGluLysAsnAsnVal                               340345350                                                                      GluGlnAspLeuLysGluLysGluAspThrIleLysGlnArgThrSer                               355360365                                                                      GluValGlnAspLeuGlnAspGluValGlnArgGluAsnThrAsnLeu                               370375380                                                                      GlnLysLeuGlnAlaGlnLysGlnGlnValGlnGluLeuLeuAspGlu                               385390395400                                                                   LeuAspGluGlnLysAlaGlnLeuGluGluGlnLeuLysGluValArg                               405410415                                                                      LysLysCysAlaGluGluAlaGlnLeuIleSerSerLeuLysAlaGlu                               420425430                                                                      LeuThrSerGlnGluSerGlnIleSerThrTyrGluGluGluLeuAla                               435440445                                                                      LysAlaArgGluGluLeuSerArgLeuGlnGlnGluThrAlaGluLeu                               450455460                                                                      GluGluSerValGluSerGlyLysAlaGlnLeuGluProLeuGlnGln                               465470475480                                                                   HisLeuGlnAspSerGlnGlnGluIleSerSerMetGlnMetLysLeu                               485490495                                                                      MetGluMetLysAspLeuGluAsnHisAsnSerGlnLeuAsnTrpCys                               500505510                                                                      SerSerProHisSerIleLeuValAsnGlyAlaThrAspTyrCysSer                               515520525                                                                      LeuSerThrSerSerSerGluThrAlaAsnLeuAsnGluHisValGlu                               530535540                                                                      GlyGlnSerAsnLeuGluSerGluProIleHisGlnGluSerProAla                               545550555560                                                                   ArgSerSerProGluLeuLeuProSerGlyValThrAspGluAsnGlu                               565570575                                                                      ValThrThrAlaValThrGluLysValCysSerGluLeuAspAsnAsn                               580585590                                                                      ArgHisSerLysGluGluAspProPheAsnValAspSerSerSerLeu                               595600605                                                                      ThrGlyProValAlaAspThrAsnLeuAspPhePheGlnSerAspPro                               610615620                                                                      PheValGlySerAspProPheLysAspAspProPheGlyLysIleAsp                               625630635640                                                                   ProPheGlyGlyAspProPheLysGlySerAspProPheAlaSerAsp                               645650655                                                                      CysPhePheArgGlnSerThrAspProPheAlaThrSerSerThrAsp                               660665670                                                                      ProPheSerAlaAlaAsnAsnSerSerIleThrSerValGluThrLeu                               675680685                                                                      LysHisAsnAspProPheAlaProGlyGlyThrValValAlaAlaSer                               690695700                                                                      AspSerAlaThrAspProPheAlaSerValPheGlyAsnGluSerPhe                               705710715720                                                                   GlyGlyGlyPheAlaAspPheSerThrLeuSerLysValAsnAsnGlu                               725730735                                                                      AspProPheArgSerAlaThrSerSerSerValSerAsnValValIle                               740745750                                                                      ThrLysAsnValPheGluGluThrSerValLysSerGluAspGluPro                               755760765                                                                      ProAlaLeuProProLysIleGlyThrProThrArgProCysProLeu                               770775780                                                                      ProProGlyLysArgSerIleAsnLysLeuAspSerProAspProPhe                               785790795800                                                                   LysLeuAsnAspProPheGlnProPheProGlyAsnAspSerProLys                               805810815                                                                      GluLysAspProGluMetPheCysAspProPheThrSerAlaThrThr                               820825830                                                                      ThrThrAsnLysGluAlaAspProSerAsnPheAlaAsnPheSerAla                               835840845                                                                      TyrProSerGluGluAspMetIleGluTrpAlaLysArgGluSerGlu                               850855860                                                                      ArgGluGluGluGlnArgLeuAlaArgLeuAsnGlnGlnGluGlnGlu                               865870875880                                                                   AspLeuGluLeuAlaIleAlaLeuSerLysSerGluIleSerGluAla                               885890895                                                                      (2) INFORMATION FOR SEQ ID NO:3:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 3033 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 111..2802                                                        (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                        CCGAGACGCCCGGCGCCGGCCTCGCCTACGGCCTGTCCCTCCGCCTCCTTCCCGCCCCGG60                 GCCCCCGTCCGTCCGTCCTTCCTTCCCCTCCCGTGCATGATGGAAACACCATGGCT116                    MetAla                                                                         GCGGCAGCCCAGCTCTCCCTGACACAGTTGTCAAGTGGGAATCCTGTA164                            AlaAlaAlaGlnLeuSerLeuThrGlnLeuSerSerGlyAsnProVal                               51015                                                                          TATGAAAAATACTACAGACAGGTTGAGGCAGGCAATACTGGAAGGGTG212                            TyrGluLysTyrTyrArgGlnValGluAlaGlyAsnThrGlyArgVal                               202530                                                                         TTGGCGTTAGATGCTGCTGCATTCCTGAAAAAGTCAGGGCTTCCAGAC260                            LeuAlaLeuAspAlaAlaAlaPheLeuLysLysSerGlyLeuProAsp                               35404550                                                                       TTGATTCTTGGAAAGATTTGGGATTTAGCTGACACAGATGGCAAAGGT308                            LeuIleLeuGlyLysIleTrpAspLeuAlaAspThrAspGlyLysGly                               556065                                                                         GTCCTGAGCAAACAAGAATTCTTTGTTGCTTTACGGCTTGTGGCATGT356                            ValLeuSerLysGlnGluPhePheValAlaLeuArgLeuValAlaCys                               707580                                                                         GCTCAGAATGGACTGGAAGTTTCACTGAGTAGCCTAAGTCTGGCTGTT404                            AlaGlnAsnGlyLeuGluValSerLeuSerSerLeuSerLeuAlaVal                               859095                                                                         CCTCCACCAAGATTTCATGACTCCAGCAGTCCGTTGCTAACCAGTGGG452                            ProProProArgPheHisAspSerSerSerProLeuLeuThrSerGly                               100105110                                                                      CCCTCAGTTGCTGAGCTCCCGTGGGCTGTAAAGTCTGAAGATAAAGCC500                            ProSerValAlaGluLeuProTrpAlaValLysSerGluAspLysAla                               115120125130                                                                   AAATATGATGCAATTTTTGACAGTTTAAGCCCAGTGGATGGATTTTTG548                            LysTyrAspAlaIlePheAspSerLeuSerProValAspGlyPheLeu                               135140145                                                                      TCTGGTGATAAAGTGAAACCAGTGTTGCTCAACTCTAAGTTACCTGTG596                            SerGlyAspLysValLysProValLeuLeuAsnSerLysLeuProVal                               150155160                                                                      GAAATCCTTGGAAGAGTTTGGGAGTTGAGTGATATTGACCACGATGGA644                            GluIleLeuGlyArgValTrpGluLeuSerAspIleAspHisAspGly                               165170175                                                                      AAGCTGGACAGAGATGAGTTTGCAGTTGCCATGTTTTTGGTATACTGT692                            LysLeuAspArgAspGluPheAlaValAlaMetPheLeuValTyrCys                               180185190                                                                      GCACTGGAGAAAGAACCTGTGCCAATGTCCTTGCCTCCAGCCTTGGTG740                            AlaLeuGluLysGluProValProMetSerLeuProProAlaLeuVal                               195200205210                                                                   CCACCTTCTAAGAGAAAAACGTGGGTTGTATCCCCTGCAGAAAAAGCT788                            ProProSerLysArgLysThrTrpValValSerProAlaGluLysAla                               215220225                                                                      AAATATGATGAAATTTTTCTGAAAACTGATAAGGATATGGATGGATAT836                            LysTyrAspGluIlePheLeuLysThrAspLysAspMetAspGlyTyr                               230235240                                                                      GTGTCTGGACTGGAGGTCCGTGAAACCTTCCTGAAAACAGGTTTACCT884                            ValSerGlyLeuGluValArgGluThrPheLeuLysThrGlyLeuPro                               245250255                                                                      TCTGCCTTGCTAGCCCACATTTGGTCACTATGTGACACAAAGGGCTGT932                            SerAlaLeuLeuAlaHisIleTrpSerLeuCysAspThrLysGlyCys                               260265270                                                                      GGGAAGCTTTCAAAAGACCAGTTTGCCTTGGCTTTTCACTTAATCAAT980                            GlyLysLeuSerLysAspGlnPheAlaLeuAlaPheHisLeuIleAsn                               275280285290                                                                   CAGAAGTTAATAAAAGGCATTGACCCTCCTCATAGTCTCACTCCTGAG1028                           GlnLysLeuIleLysGlyIleAspProProHisSerLeuThrProGlu                               295300305                                                                      ATGATTCCACCATCAGACAGATCCAGTTTACAAAAGAACATCACAGGA1076                           MetIleProProSerAspArgSerSerLeuGlnLysAsnIleThrGly                               310315320                                                                      TCAAGTCCTGTTGCAGATTTTTCTGCTATTAAGGAACTAGATACCCTT1124                           SerSerProValAlaAspPheSerAlaIleLysGluLeuAspThrLeu                               325330335                                                                      AACAATGAAATAGTTGACCTGCAGAGGGAAAAGAACAATGTGGAGCAG1172                           AsnAsnGluIleValAspLeuGlnArgGluLysAsnAsnValGluGln                               340345350                                                                      GACCTTAAAGAGAAGGAAGACACAGTTAAGCAGAGGACCAGTGAGGTT1220                           AspLeuLysGluLysGluAspThrValLysGlnArgThrSerGluVal                               355360365370                                                                   CAGGATCTTCAAGATGAAGTTCAAAGGGAGAGTATTAATCTACAAAAA1268                           GlnAspLeuGlnAspGluValGlnArgGluSerIleAsnLeuGlnLys                               375380385                                                                      CTGCAGGCCCAGAAGCAGCAGGTGCAGGAGCTCCTGGGTGAACTGGAT1316                           LeuGlnAlaGlnLysGlnGlnValGlnGluLeuLeuGlyGluLeuAsp                               390395400                                                                      GAGCAGAAAGCCCAGCTGGAGGAGCAGCTCCAGGAAGTCAGGAAAAAG1364                           GluGlnLysAlaGlnLeuGluGluGlnLeuGlnGluValArgLysLys                               405410415                                                                      TGTGCTGAGGAGGCCCAGCTGATTTCTTCCCTGAAAGCAGAAATAACT1412                           CysAlaGluGluAlaGlnLeuIleSerSerLeuLysAlaGluIleThr                               420425430                                                                      AGTCAAGAATCTCAGATCTCCAGTTATGAGGAAGAACTGTTGAAAGCT1460                           SerGlnGluSerGlnIleSerSerTyrGluGluGluLeuLeuLysAla                               435440445450                                                                   AGAGAAGAACTAAGTCGCCTACAACAAGAAACAGCACAATTGGAAGAA1508                           ArgGluGluLeuSerArgLeuGlnGlnGluThrAlaGlnLeuGluGlu                               455460465                                                                      AGTGTGGAGTCAGGGAAGGCTCAGCTGGAACCTCTTCAGCAGCACCTA1556                           SerValGluSerGlyLysAlaGlnLeuGluProLeuGlnGlnHisLeu                               470475480                                                                      CAAGAGTCACAACAGGAAATCAGCTCAATGCAAATGAGATTGGAAATG1604                           GlnGluSerGlnGlnGluIleSerSerMetGlnMetArgLeuGluMet                               485490495                                                                      AAAGATCTGGAAACTGATAATAACCAATCAAATTGGAGCAGTAGCCCA1652                           LysAspLeuGluThrAspAsnAsnGlnSerAsnTrpSerSerSerPro                               500505510                                                                      CAAAGCGTTCTTGTTAATGGTGCTACAGATTACTGTAGCCTCAGCACC1700                           GlnSerValLeuValAsnGlyAlaThrAspTyrCysSerLeuSerThr                               515520525530                                                                   AGCAGCAGTGAAACAGCCAACTTCAACGAACATGCTGAAGGCCAAAAC1748                           SerSerSerGluThrAlaAsnPheAsnGluHisAlaGluGlyGlnAsn                               535540545                                                                      AACCTAGAGTCTGAACCCACACACCAGGAGTCCTCAGTAAGAAGTAGT1796                           AsnLeuGluSerGluProThrHisGlnGluSerSerValArgSerSer                               550555560                                                                      CCTGAAATCGCACCTTCTGATGTGACTGATGAAAGTGAGGCTGTGACT1844                           ProGluIleAlaProSerAspValThrAspGluSerGluAlaValThr                               565570575                                                                      GTGGCTGGTAATGAGAAAGTTACTCCGAGATTTGACGATGACAAGCAC1892                           ValAlaGlyAsnGluLysValThrProArgPheAspAspAspLysHis                               580585590                                                                      TCAAAAGAGGAAGATCCATTTAATGTAGAATCAAGTTCACTGACAGAT1940                           SerLysGluGluAspProPheAsnValGluSerSerSerLeuThrAsp                               595600605610                                                                   GCAGTTGCAGATACAAACTTGGATTTTTTCCAGTCTGATCCTTTTGTT1988                           AlaValAlaAspThrAsnLeuAspPhePheGlnSerAspProPheVal                               615620625                                                                      GGCAGTGATCCTTTCAAGGATGATCCTTTTGGAAAAATTGATCCATTT2036                           GlySerAspProPheLysAspAspProPheGlyLysIleAspProPhe                               630635640                                                                      GGTGGTGACCCTTTCAAAGGCTCAGATCCTTTTGCGTCTGATTGCTTC2084                           GlyGlyAspProPheLysGlySerAspProPheAlaSerAspCysPhe                               645650655                                                                      TTTAAGCAGACTTCTACTGATCCTTTTACCACTTCAAGTACGGACCCT2132                           PheLysGlnThrSerThrAspProPheThrThrSerSerThrAspPro                               660665670                                                                      TTCAGTGCATCCAGCAACAGCAGTAATACATCGGTAGAAACTTGGAAG2180                           PheSerAlaSerSerAsnSerSerAsnThrSerValGluThrTrpLys                               675680685690                                                                   CACAATGACCCATTTGCTCCTGGTGGAACAGTTGTTGCTGCAGCGAGT2228                           HisAsnAspProPheAlaProGlyGlyThrValValAlaAlaAlaSer                               695700705                                                                      GATTCAGCCACAGACCCTTTTGCTTCTGTTTTCGGAAATGAATCATTT2276                           AspSerAlaThrAspProPheAlaSerValPheGlyAsnGluSerPhe                               710715720                                                                      GGAGATGGATTTGCTGACTTCAGCACATTATCAAAGGTCAACAATGAA2324                           GlyAspGlyPheAlaAspPheSerThrLeuSerLysValAsnAsnGlu                               725730735                                                                      GATGCTTTTAATCCTACCATATCAAGTTCTACCAGCAGTGTGACCATT2372                           AspAlaPheAsnProThrIleSerSerSerThrSerSerValThrIle                               740745750                                                                      GCAAAACCTATGTTAGAGGAAACAGCCAGCAAGAGTGAAGATGTGCCT2420                           AlaLysProMetLeuGluGluThrAlaSerLysSerGluAspValPro                               755760765770                                                                   CCAGCACTGCCGCCCAAAGTTGGCACTCCAACAAGACCTTGCCCGCCA2468                           ProAlaLeuProProLysValGlyThrProThrArgProCysProPro                               775780785                                                                      CCCCCTGGGAAAAGACCCATCAACAAATTGGATTCTTCTGATCCCCTT2516                           ProProGlyLysArgProIleAsnLysLeuAspSerSerAspProLeu                               790795800                                                                      AAACTGAATGATCCATTTCAGCCTTTCCCAGGCAATGATAGTCCCAAA2564                           LysLeuAsnAspProPheGlnProPheProGlyAsnAspSerProLys                               805810815                                                                      GAAAAAGATCCTGATATGTTTTGTGATCCATTCACTTCTTCTACCACT2612                           GluLysAspProAspMetPheCysAspProPheThrSerSerThrThr                               820825830                                                                      ACCAATAAAGAGGCTGACCCAAGCAATTTTGCTAACTTCAGTGCTTAT2660                           ThrAsnLysGluAlaAspProSerAsnPheAlaAsnPheSerAlaTyr                               835840845850                                                                   CCCTCTGAAGAAGATATGATTGAATGGGCAAAAAGGGAAAGTGAGCGG2708                           ProSerGluGluAspMetIleGluTrpAlaLysArgGluSerGluArg                               855860865                                                                      GAAGAAGAACAGAGGCTTGCCAGACTAAATCAGCAGGAGCAAGAAGAC2756                           GluGluGluGlnArgLeuAlaArgLeuAsnGlnGlnGluGlnGluAsp                               870875880                                                                      TTGGAACTGGCCATTGCACTTAGCAAATCTGAGATCTCAGAAGCAT2802                             LeuGluLeuAlaIleAlaLeuSerLysSerGluIleSerGluAla                                  885890895                                                                      GAAGAGTTATCTGTCCTTTGTCAGCAGTACAGTGCTCTCTGGAACACTGAAGCTATTTAC2862               CATGTGCATCAAACTACCTATGAGCATGGGATACAAAAGGTTTGAGATTCCTAGAAATGT2922               GACAAAAGTCTAGTTTGTTTTTTTTTTTTTTTTTGGGGGGGGGTGCTATTTCAAATGTGT2982               CTTTTATTTTTTCTTCCAAAAGCAGTACCCTAATTAAACGGCTTTGCCTAG3033                        (2) INFORMATION FOR SEQ ID NO:4:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 897 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                        MetAlaAlaAlaAlaGlnLeuSerLeuThrGlnLeuSerSerGlyAsn                               151015                                                                         ProValTyrGluLysTyrTyrArgGlnValGluAlaGlyAsnThrGly                               202530                                                                         ArgValLeuAlaLeuAspAlaAlaAlaPheLeuLysLysSerGlyLeu                               354045                                                                         ProAspLeuIleLeuGlyLysIleTrpAspLeuAlaAspThrAspGly                               505560                                                                         LysGlyValLeuSerLysGlnGluPhePheValAlaLeuArgLeuVal                               65707580                                                                       AlaCysAlaGlnAsnGlyLeuGluValSerLeuSerSerLeuSerLeu                               859095                                                                         AlaValProProProArgPheHisAspSerSerSerProLeuLeuThr                               100105110                                                                      SerGlyProSerValAlaGluLeuProTrpAlaValLysSerGluAsp                               115120125                                                                      LysAlaLysTyrAspAlaIlePheAspSerLeuSerProValAspGly                               130135140                                                                      PheLeuSerGlyAspLysValLysProValLeuLeuAsnSerLysLeu                               145150155160                                                                   ProValGluIleLeuGlyArgValTrpGluLeuSerAspIleAspHis                               165170175                                                                      AspGlyLysLeuAspArgAspGluPheAlaValAlaMetPheLeuVal                               180185190                                                                      TyrCysAlaLeuGluLysGluProValProMetSerLeuProProAla                               195200205                                                                      LeuValProProSerLysArgLysThrTrpValValSerProAlaGlu                               210215220                                                                      LysAlaLysTyrAspGluIlePheLeuLysThrAspLysAspMetAsp                               225230235240                                                                   GlyTyrValSerGlyLeuGluValArgGluThrPheLeuLysThrGly                               245250255                                                                      LeuProSerAlaLeuLeuAlaHisIleTrpSerLeuCysAspThrLys                               260265270                                                                      GlyCysGlyLysLeuSerLysAspGlnPheAlaLeuAlaPheHisLeu                               275280285                                                                      IleAsnGlnLysLeuIleLysGlyIleAspProProHisSerLeuThr                               290295300                                                                      ProGluMetIleProProSerAspArgSerSerLeuGlnLysAsnIle                               305310315320                                                                   ThrGlySerSerProValAlaAspPheSerAlaIleLysGluLeuAsp                               325330335                                                                      ThrLeuAsnAsnGluIleValAspLeuGlnArgGluLysAsnAsnVal                               340345350                                                                      GluGlnAspLeuLysGluLysGluAspThrValLysGlnArgThrSer                               355360365                                                                      GluValGlnAspLeuGlnAspGluValGlnArgGluSerIleAsnLeu                               370375380                                                                      GlnLysLeuGlnAlaGlnLysGlnGlnValGlnGluLeuLeuGlyGlu                               385390395400                                                                   LeuAspGluGlnLysAlaGlnLeuGluGluGlnLeuGlnGluValArg                               405410415                                                                      LysLysCysAlaGluGluAlaGlnLeuIleSerSerLeuLysAlaGlu                               420425430                                                                      IleThrSerGlnGluSerGlnIleSerSerTyrGluGluGluLeuLeu                               435440445                                                                      LysAlaArgGluGluLeuSerArgLeuGlnGlnGluThrAlaGlnLeu                               450455460                                                                      GluGluSerValGluSerGlyLysAlaGlnLeuGluProLeuGlnGln                               465470475480                                                                   HisLeuGlnGluSerGlnGlnGluIleSerSerMetGlnMetArgLeu                               485490495                                                                      GluMetLysAspLeuGluThrAspAsnAsnGlnSerAsnTrpSerSer                               500505510                                                                      SerProGlnSerValLeuValAsnGlyAlaThrAspTyrCysSerLeu                               515520525                                                                      SerThrSerSerSerGluThrAlaAsnPheAsnGluHisAlaGluGly                               530535540                                                                      GlnAsnAsnLeuGluSerGluProThrHisGlnGluSerSerValArg                               545550555560                                                                   SerSerProGluIleAlaProSerAspValThrAspGluSerGluAla                               565570575                                                                      ValThrValAlaGlyAsnGluLysValThrProArgPheAspAspAsp                               580585590                                                                      LysHisSerLysGluGluAspProPheAsnValGluSerSerSerLeu                               595600605                                                                      ThrAspAlaValAlaAspThrAsnLeuAspPhePheGlnSerAspPro                               610615620                                                                      PheValGlySerAspProPheLysAspAspProPheGlyLysIleAsp                               625630635640                                                                   ProPheGlyGlyAspProPheLysGlySerAspProPheAlaSerAsp                               645650655                                                                      CysPhePheLysGlnThrSerThrAspProPheThrThrSerSerThr                               660665670                                                                      AspProPheSerAlaSerSerAsnSerSerAsnThrSerValGluThr                               675680685                                                                      TrpLysHisAsnAspProPheAlaProGlyGlyThrValValAlaAla                               690695700                                                                      AlaSerAspSerAlaThrAspProPheAlaSerValPheGlyAsnGlu                               705710715720                                                                   SerPheGlyAspGlyPheAlaAspPheSerThrLeuSerLysValAsn                               725730735                                                                      AsnGluAspAlaPheAsnProThrIleSerSerSerThrSerSerVal                               740745750                                                                      ThrIleAlaLysProMetLeuGluGluThrAlaSerLysSerGluAsp                               755760765                                                                      ValProProAlaLeuProProLysValGlyThrProThrArgProCys                               770775780                                                                      ProProProProGlyLysArgProIleAsnLysLeuAspSerSerAsp                               785790795800                                                                   ProLeuLysLeuAsnAspProPheGlnProPheProGlyAsnAspSer                               805810815                                                                      ProLysGluLysAspProAspMetPheCysAspProPheThrSerSer                               820825830                                                                      ThrThrThrAsnLysGluAlaAspProSerAsnPheAlaAsnPheSer                               835840845                                                                      AlaTyrProSerGluGluAspMetIleGluTrpAlaLysArgGluSer                               850855860                                                                      GluArgGluGluGluGlnArgLeuAlaArgLeuAsnGlnGlnGluGln                               865870875880                                                                   GluAspLeuGluLeuAlaIleAlaLeuSerLysSerGluIleSerGlu                               885890895                                                                      Ala                                                                            __________________________________________________________________________ 

What is claimed is:
 1. A purified eps15 polypeptide, wherein said eps15 polypeptide has biological activity as a substrate in the pathway of the Epidermal Growth Factor Receptor (EGFR) kinase as evidenced by tyrosine phosphorylation of said eps15 polypeptide or mitogen stimulation by said eps15 polypeptide upon activation of said EGFR kinase, comprising the amino acid sequence of SEQ ID NO:3 or SEQ ID NO:4, or the amino acid sequence encoded by DNA that hybridizes under low stringency conditions to the protein-encoding domain of SEQ ID NO:1 or SEQ ID NO:2.
 2. A purified human eps15 polypeptide comprising the amino acid sequence of SEQ ID NO:2.
 3. A purified murine eps15 polypeptide comprising the amino acid sequence of SEQ ID NO:4.
 4. A biologically active polypeptide fragment of the eps15 polypeptide of claim 1, having the same properties as said eps15 polypeptide as evidenced by said tyrosine phosphorylation or said mitogen stimulation.
 5. A biologically active conservative variant of the eps15 polypeptide of claim 1, having the same properties as said eps15 polypeptide as evidenced by said tyrosine phosphorylation or said mitogen stimulation.
 6. The purified eps15 polypeptide of claim 1, in a concentration of at least 0.01% by weight.
 7. The purified eps15 polypeptide of claim 1, in a concentration of at least 0.1% by weight.
 8. The purified eps15 polypeptide of claim 1, in a concentration of at least 0.5% by weight.
 9. The purified eps15 polypeptide of claim 1, in a concentration of at least 1% by weight.
 10. The purified eps15 polypeptide of claim 1, in a concentration of at least 5% by weight.
 11. The purified eps15 polypeptide of claim 1, in a concentration of at least 10% by weight.
 12. The purified eps15 polypeptide of claim 1, in a concentration of at least 20% by weight. 